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Phosphorylation of the coronavirus nucleoprotein (N protein) has been predicted to play a role in RNA 
binding. To investigate this hypothesis, we examined the kinetics of RNA binding between nonphosphorylated 
and phosphorylated infectious bronchitis virus N protein with nonviral and viral RNA by surface plasmon 
resonance (Biacore). Mass spectroscopic analysis of N protein identified phosphorylation sites that were 
proximal to RNA binding domains. Kinetic analysis, by surface plasmon resonance, indicated that nonphos¬ 
phorylated N protein bound with the same affinity to viral RNA as phosphorylated N protein. However, 
phosphorylated N protein bound to viral RNA with a higher binding affinity than nonviral RNA, suggesting 
that phosphorylation of N protein determined the recognition of virus RNA. The data also indicated that a 
known N protein binding site (involved in transcriptional regulation) consisting of a conserved core sequence 
present near the 5' end of the genome (in the leader sequence) functioned by promoting high association rates 
of N protein binding. Further analysis of the leader sequence indicated that the core element was not the only 
binding site for N protein and that other regions functioned to promote high-aifinity binding. 


Avian infectious bronchitis virus (IBV) is a member of the 
Coronaviridae (order Nidovirales ) (9), members of which are 
enveloped viruses with single-stranded, positive-sense RNA 
genomes that are 5' capped and 3' polyadenylated (30, 63). 
The 5' two-thirds of the coronavirus genome encodes the rep- 
licase gene producing two polyproteins, Repla and Replab, 
the latter resulting from a —1 frameshift (7). The remaining 
proteins, which include the nucleoprotein (N), are expressed 
from a nested set of subgenomic mRNAs (sgRNAs) that are 
produced via a discontinuous transcription mechanism (6, 30). 
Each of these sgRNAs has a short nontranslated leader se¬ 
quence (64 nucleotides for IBV) derived from the 5' end of the 
genome. Present within the leader sequence is a consensus 
sequence, which we have termed the transcription-associated 
sequence (TAS) (24). The TAS contains a conserved core 
motif, which in the case of IBV is CUUAACAA, which is also 
located in the genome, proximal to the start site for each 
sgRNA. For different coronaviruses, the core sequence varies 
and can be present more than once per TAS. 

N protein, the virus RNA binding protein, is one of the most 
abundant viral proteins in an infected cell (31). Several func¬ 
tions have been postulated for the coronavirus N protein 
throughout the virus life cycle (31); primarily, it complexes 
with the genomic RNA to form a ribonucleocapsid structure 
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(17) and associates with the M protein (19, 39) to form the viral 
core (48). While N protein is required in trans to rescue the 
full-length clone of IBV (8) and a porcine coronavirus trans¬ 
missible gastroenteritis virus clone (82), it is not required for 
others (1, 71, 72). Certainly, in the case of the rescue of the 
full-length clone of severe acute respiratory syndrome corona¬ 
virus, the presence of N protein increases viral titers compared 
to rescue performed in the absence of N transcript (83), sug¬ 
gesting that N protein may be involved in the efficiency of 
replication but that it is not essential. 

Based on amino acid sequence comparisons, three domains 
have been identified in the murine coronavirus, mouse hepa¬ 
titis virus (MHV) N protein (46), of which the central domain 
(domain II) was identified as a potential RNA binding site (35, 
40) capable of binding both coronavirus- and non-coronavirus- 
derived RNA sequences (35, 68). However, whether this bind¬ 
ing occurs with equal or different affinity is uncertain (14, 35, 
49). N protein has been shown to associate with several motifs 
on viral RNA, including the leader RNA sequence, with par¬ 
ticular affinity for the core sequence of the TAS (2, 41), se¬ 
quences at the 3' end of the genomic RNA (84), and the pack¬ 
aging signal (37). How these sequences promote N binding is 
unknown. 

Several coronavirus N proteins have been shown to be phos¬ 
phorylated, including IBV, MHV, and transmissible gastro¬ 
enteritis virus N proteins, although the precise sites were not 
identified (31). The role of phosphorylation in the virus life 
cycle is unknown, although the phosphorylation state of N 
protein has been predicted to play a role in determining the 
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TABLE 1. Sequences of oligonucleotides used in this study" 


Oligonucleotide Sequence (5'-3') Polarity 


3LeaderTAS B-ACUUAA GAUAGAUAUUAAUAUAUAUCUAUU ACA CUAGCCUUGCGCUAG AUUUUUAACUGAAC + 

AAUACAGACCUAAAAAGU CU GUUU GAU G 

3LeaderSAT B-ACUUAA GAUAGAUAUUAAUAUAUAUCUAUU ACA CUAGCCUUGCGCUAG AUUUUUAAgaacucgu + 

UACAGACCUAAAAAGUCUGUUUGAUG 

Random90 B-uucaucuucaucacucaaugaaucacuaagaucagucauucauaugugacuagugauauaagaucauucaaaguugauaauacuguuaua + 

Random90TAS B-uucaucuucaucacucaaugaaucacuaagaucagucauucauaugugacuagugaCUGAACAAcauucaaaguugauaauacuguuaua + 

DSL1SL2 B-uucaucuucaucacucaaugaaucacuaagAC ACUAGCCUUGCGCUAG AUUUUUAAuauaagauUACAGACCUAA + 

AAAGUCUGUUUGAUG 

DSLlSL2rand B-uucaucuucaucacucaaugaaucacuaagAC ACUAGCCUUGCGCUAG AUUUUUAAuauaagaucauucaaaguugauaaua + 

cuguuaua 

DSL1SL2TAS B-uucaucuucaucacucaaugaaucacuaagAC ACUAGCCUUGCGCUAG AUUUUUAACUGAACAAUACAGACCUA + 

AAAAGUCUGUUUGAUG 

SLlSL2TASrand B-ACUUAA GAUAGAUAUUAAUAUAUAUCUAUU ACA CUAGCCUUGCGCUAG AUUUUUAACUGAAC + 

AAcauucaaaguugauaauacuguuaua 

SLITASrand B-ACUUAA GAUAGAUAUUAAUAUAUAUCUAUU aucagucauucauaugugacuagugaCUGAACAAcauucaaaguug + 

auaauacuguuaua 

SLlSL2rand B-ACUUAA GAUAGAUAUUAAUAUAUAUCUAUU ACA CUAGCCUUGCGCUAG AUUUUUAAuauaagau + 

cauucaaaguugauaauacuguuaua 

Antileader B-CAU CAAACAGACUUUUUAGGU CU GUAUU GUUCAGUUAAAAAU CUAGCGCAAGGCUAGU GUA 

AUAGAUAUAUAUUAAUAUCUAUCUUAAG 


" The biotin moiety is denoted (B-), the authentic IBV gene 3 sequence is in uppercase, the random sequence is in lower case, and the position of the gene 3 TAS/ 
scrambled/inactive TAS is in boldface type. The sequence that corresponds to predicted SL1 is single underlined, and that corresponding to predicted SL2 is double 
underlined. 


affinity of binding of viral RNA (31, 41), and dephosphoryla¬ 
tion of MHV N protein by cellular phosphoprotein phospha¬ 
tase has been suggested to facilitate the infectious process (36). 
Siddell et al. (64) identified a virion-associated protein kinase 
activity that could transfer an additional phosphate from ATP 
to N protein. 

We investigated the role of phosphorylation of IBV N pro¬ 
tein in RNA binding by using surface plasmon resonance, which 
has not previously been used to study coronavirus N protein 
binding kinetics. To model nonphosphorylated and phosphor- 
ylated protein, we expressed and purified N protein from Esch¬ 
erichia coli and Sf9 cells, respectively. Mass spectroscopic anal¬ 
ysis indicated that the sample expressed in Sf9 cells was 
phosphorylated proximal to RNA binding domains and had a 
phosphorylation pattern identical to that of N protein ex¬ 
pressed in the model cell line for IBV infection. We used either 
an IBV-defective RNA to model the viral genome or syn¬ 
thetic RNAmers resembling the viral leader sequence as target 
RNAs. Our results showed that the phosphorylated form of 
N protein has decreased affinity for random RNA and that 
multiple high-affinity N protein binding sites were present 
on leader RNA. 

MATERIALS AND METHODS 

Biotinylated RNAmers. Biotinylated RNA oligonucleotides used in this study 
were obtained from Invitrogen and purified by high-pressure liquid chromatog¬ 
raphy (HPLC). Their sequences are shown in Table 1 and schematically repre¬ 
sented in Fig. 1. Biotinylated oligo(dT) was obtained from Promega. 

Recombinant plasmids and viruses. To synthesize recombinant IBV N pro¬ 
tein from E. coli, the IBV (strain Beaudette) N protein gene was cloned into 
pTriExl.l (Novagen), creating pTriExIBVN, as described previously (81). To 
synthesize recombinant IBV N protein from insect cells (Sf9), IBV N protein was 
inserted into BacVector 3000 by using pTriExIBVN as a template for homolo¬ 
gous recombination, as described previously (81). The vector places a C-terminal 
His tag on the N protein and has been used to obtain recombinant purified N 
protein (13). pCD-61 contains the cDNA for CD-61, an RNA which can be 
rescued by IBV (15, 47). Vero (simian) cells were maintained in Dulbecco 
modified Eagle medium supplemented with 10% fetal bovine serum. IBV Beau¬ 
dette strain was grown in 11-day-old embryonated specific-pathogen-free domes¬ 
ticated fowl eggs and harvested from allantoic fluid for 16 h postinfection. 


Infection and transfection of Vero cells. Vero cells (seeded at 10 5 in 9.6 cm 2 ) 
were either transfected or infected at 70% confluence. Transfections were per¬ 
formed with 2 |xg of pCI-Neo-N with 8 pi of Lipofectamine transfection reagent 
(Invitrogen) and incubated in Opti-MEM serum-free medium (Invitrogen) at 
37°C for 4 h. The transfection medium was replaced with 2 ml of supplemented 
Dulbecco modified Eagle medium and left for an additional 20 h. Vero cells were 
infected with 1 ml of IBV Beaudette (multiplicity of infection of approximately 
1) and incubated for 1 h at 37°C. The initial inoculum was replaced with 1 ml of 
media and incubated for a further 7 h postinfection. 

Preparation of total protein extracts from Vero cells and Western blotting. 
Vero cells were washed with phosphate-buffered saline, detached by a cell 
scraper into 1 ml of phosphate-buffered saline, pelleted by centrifugation at 
250 X g for 5 min before lysis by addition of 200 pi of cell lysis buffer (20 mM 
Tris-HCl [pH 7.5], 100 mM NaCl, 0.5 mM EDTA, 0.5% NP-40, and complete 
protease cocktail inhibitor [Roche]) for 10 min at 4°C. Samples were centrifuged 
at 10,000 X g for 5 min at 4°C, and supernatants were transferred into a new 
Eppendorf tube and stored at — 80°C. 

Preparation of N protein from E. coli, Sf9, and Vero cells. Expression and 
purification of His-tagged N protein from either E. coli or Sf9 cells has been 
previously described (12, 13). To obtain N protein from mammalian cells, Vero 
cells were transfected with pTriExIBVN as described previously (13), and puri¬ 
fication was identical to that described for N protein from Sf9 cells. Western 
blotting was performed by using the ECL detection kit (Amersham Pharmacia) 
according to the manufacturer’s instructions. Briefly, each sample (3 p,g of total 
protein as quantified by Bradford assay) was separated on a 10% NuPage Bis- 
Tris precast polyacrylamide gel (Invitrogen) in morpholinepropanesulfonic acid 
(MOPS) running buffer. Protein was transferred onto a polyvinylidene difluoride 
membrane (Invitrogen) in Invitrogen transfer buffer for 1 h at room tempera¬ 
ture. The membrane was blocked for 1 h in Tris-buffered saline (TBS) plus 
0.05% Tween 20 and 5% milk. Primary rabbit anti-IBV polyclonal sera (a gift 
from D. Cavanagh, IAH Compton), diluted 1:10,000 in TBS plus 0.05% Tween 
20, was incubated with the membrane for 2 h at room temperature to detect IBV 
N protein. The membrane was washed three times in TBS plus 0.05% Tween, 
and then the membrane was incubated with secondary goat anti-rabbit immu¬ 
noglobulin G conjugated to horseradish peroxidase (diluted 1:1,000 in TBS plus 
0.05% Tween) for 1 h at room temperature. The membrane was then washed 
three times, treated with ECL detection reagents (Amersham Pharmacia), and 
subjected to autoradiography. 

Mass spectroscopy. Recombinant IBV N protein (from E. coli [N E coli ] or Sf9 
[N S fg] cells) was dialyzed into digest buffer (100 mM ammonium bicarbonate, 
2 M urea [pH 7.8]). Proteins were digested by the addition of approximately 5 to 
10% (wt/wt) endoproteinase Glu-C (Sigma). Resulting proteolytic peptides were 
analyzed by online capillary HPLC-mass spectrometry (MS). Samples were pre¬ 
concentrated and desalted by use of a self-packed protein trap, followed by 
separation by use of a 180-p.m-inner-diameter capillary column, self-packed with 
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FIG. 1. Diagrammatic representation of the synthetic RNAmers used in this study to show delineation between IBV and a random sequence. 
Note that SL1 and SL2 are predicted structures only and not yet established by experimental work. 


Hichrom C 18 reversed-phase sorbent (3.5-pm bead size, 150-A pore size). Pep¬ 
tides were eluted from the column by an increasing gradient of solvent B (0.05% 
vol/vol trifluoroacetic acid in 5:95 water-acetonitrile), and solvent A was 0.05% 
vol/vol trifluoroacetic acid in 95:5 water-acetonitrile. The eluent was passed, via 
a U-Z View UV flow cell (LC Packings), to a Quattro II tandem quadrupole 
mass spectrometer (Micromass United Kingdom Ltd.). The mass spectrometer 
was equipped with a commercially available Z-spray source operated in contin¬ 
uous flow nanospray mode. Full-scan mass spectra were acquired in positive-ion 
mode from an m/z of 300 to 2,100 with a scan time of 4 s. The voltage applied to 
the extraction cone was ramped to improve detection of high -m/z ions. Recom¬ 
binant IBV N protein expressed by Vero cells (N Vero ) was digested with trypsin 
and analyzed by HPLC-MS or automated HPLC-MS/MS at Micromass United 


Kingdom Ltd. Peptides were separated on a reversed-phase capillary HPLC 
column (75-pm C 18 Pepmap; LC Packings), and the eluent was passed directly to 
the Z-spray source of a Micromass Q-Tof Ultima API (Waters Corp.) mass 
spectrometer. The mass spectrometer was operated in data-dependent acquisi¬ 
tion mode to generate peptide sequence information automatically, and theo¬ 
retical phosphopeptide ions were added to the include list to ensure that these 
peptides were sequenced. 

Runoff transcription and hybridization of CD-61 RNA to biotinylated oli- 
go(dT). pCD-61 containing the cDNA for the 6.1-kb minigenome of Beaudette 
IBV was made linear by digestion with Notl. RNA with a 3' poly(A) tail was in 
vitro synthesized from pCD-61 by using T7 RNA polymerase (Promega) as 
described previously (47). RNA (2.5 pmol) was denatured in 400 pi of 0.5 X SSC 
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Time (s) 

FIG. 2. Sensorgram showing the binding analyte (nonphosphory- 
lated N protein) to target ligand with different flow rates to investigate 
mass transport effects. N protein (the analyte) at the concentration of 
6.25 nM was injected at flow rate of 5, 15, or 75 pl/min. RU, response 
units. 


(IX SSC is 0.15 M NaCl plus 0.015 M sodium citrate) at 65°C for 10 min. 
Biotinylated oligo(dT) (2.5 pmol) was annealed to the RNA by cooling to room 
temperature. 

Surface plasmon resonance analysis of RNA-protein interactions. Surface 
plasmon resonance provides both kinetic and equilibrium data about intermo- 
lecular interactions and is an excellent tool for the study of RNA-protein inter¬ 
actions (27). In most nucleic acid protein binding studies, RNA is the ligand and 
is therefore anchored to a solid matrix and the protein is the analyte and is 
passed over this material (27). In our experiments, kinetic analysis was per¬ 
formed by using a Biacore 3000 (Uppsala, Sweden). Research grade sensor chips 
coated with streptavidin were obtained from Biacore. Unless otherwise stated, 
the the running buffer (also used for analyte dilution) was HBS-EP (0.01 M 
HEPES, 0.15 M NaCl, 3 mM EDTA, 0.005% surfactant P20 [pH 7.4]). 

To reduce effects attributed to mass transport, low levels of ligand (given in 
relative units [RU]) are immobilized on the solid matrix, coupled with a fast flow 
rate of analyte (27). For example, Stockley et al. (67), in their study of binding of 
RNA polymerase to immobilized DNA, suggested that between 200 and 800 RU 
represented low levels of immobilized ligand, and they used a flow rate of 30 or 
40 pl/min. Likewise, Waysbort et al. (76) immobilized on average 1,000 RU of 
ligand and used a flow rate of 5 |xl/min of analyte in their study of pyrimidine 
tract binding protein. Blaesing et al. (4) immobilized between 100 and 700 RU 
of an oligonucleotide containing a DnaA box motif and a flow rate of 100 pl/min 
in an investigation of the DNA-binding domain of E. coli DnaA protein. In 
contrast, Park et al. (45) immobilized 30 to 50 RU of target RNA and used a flow 
rate of 30 pl/min. In a binding study, Kortt et al. (28) found that, with 1,260 or 
fewer RU of ligand and a flow rate of analyte between 20 and 50 pl/min, no mass 
transport was apparent and experimental data exhibited no marked deviation 
from the 1.1 Langmuir binding model used in the study. Given the apparent 
variation in RU of immobilized ligand used in different binding studies, the 
average RU of immobilized ligand in our study was approximately 500 RU (as 
recommended by BLAcoreAB, application note 301, Bianote). The average for 
CD-61 RNA was 649 RU. For RNAmers, the averages were 526 RU for 31ead- 
erTAS, 490 RU for 31eaderSAT, 488 RU for random90, 488 RU for 
random90TAS, 435 RU for SLlSL2TASrand, 418 RU for SLlSL2rand, 514 RU 
for DSL1SL2TAS, 573 RU for DSL1SL2, 560 RU for DSLlSL2rand, 563 RU for 
SLITASrand, 450 RU for antileader, 477 RU for 31eaderTAS, 614 RU for 
31eaderSAT, 514 RU for random90, and 620 RU for random90TAS. 

Prior to our kinetic analysis, we ran control experiments to assess the influence 
of mass transport on the ligand-analyte interaction and to investigate the use of 
high flow rates of analyte (N protein) to reduce possible mass flow effects. In 
brief, different concentrations of N protein were injected at a flow rate of 5, 15, 
or 75 pl/min over immobilized RNA. As an example, the average binding kinetics 
of nonphosphorylated N protein (at a concentration of 6.25 nM) to target RNA 
are shown in Fig. 2 and kinetic values are given in Table 2. The average K d values 
obtained were 13.3, 8.44, and 6.97 nM, respectively, indicating that there was no 
significant difference between the data from a flow rate of 15 to 75 pl/min. 
Therefore, in subsequent work to study RNA-N protein interactions, and similar 
to that used by Park et al. (45), we used a flow rate of 30 pl/min to minimize the 
influence of mass transport. 


CD-61 RNA ligand was immobilized to the streptavidin chip by injection of 25 
pi of the RNA hybridized with oligo(dT) at a flow rate of 5 pl/min. The kinetic 
interactions between CD-61 RNA and the IBV N proteins were analyzed by a 
6-min injection of the appropriate analyte concentration followed by a 15-min 
dissociation period. The immobilized CD-61 RNA surface was regenerated by 
two 30-pi injections of 2 M NaCl and allowed to stabilize for 3 min before the 
subsequent analyte injection. For the interaction of N proteins with RNA oli¬ 
gomers, biotinylated RNA was diluted to 50 nM with HBS-EP buffer and then 
5 pi of RNA was injected at a flow rate of 5 pl/min. Binding experiments were 
performed at 37°C. Subsequent data were obtained at a rate of 2 points/s. The 
response from the blank lane alone was subtracted to correct for refractive index 
changes due to buffer effects and nonspecific interactions with the dextran matrix 
and presented as sensorgrams of the actual tracings (RU versus time). For all 
experiments, the data were analyzed by using BLAEvaluation software (version 
3.1; Biacore); a statistically significant fit to the binding data was obtained by 
nonlinear regression. For data analysis, we used models for separated KJK d , i.e., 
1:1 (Langmuir) association and 1:1 (Langmuir) dissociation models. After each 
binding experiment, we monitored for mass transport effects by using the kinetic 
analysis available on the Biacore 3000. No mass transport effects were observed 
under the conditions used. To confirm the 1:1 kinetics, we examined the binding 
of N protein to target RNA by immobilizing CD-61 RNA, injecting over 6.25 nM 
N protein for 1, 3, or 20 min, and then examining the dissociation rate, which 
remained unchanged (Fig. 3), indicating that binding was 1:1 and that no sec¬ 
ondary binding occurred. 

RESULTS 

We wanted to investigate the role of phosphorylation and 
viral RNA sequences in the binding of the coronavirus N 
protein to RNA by using surface plasmon resonance. There¬ 
fore, a source of both purified nonphosphorylated and phos- 
phorylated N protein was required. Previous studies have suc¬ 
cessfully used a variety of sources of N protein to study RNA 
binding, including bacterially expressed N protein (41, 84), in 
vitro-transcribed and -translated N protein (35, 49) (which in 
both cases were presumably not phosphorylated), or N protein 
obtained from infected cell lysates (14) (and thus containing a 
mixture of nonphosphorylated and phosphorylated viral and 
cellular proteins). It was previously shown that N protein pro¬ 
duced in Vero cells (the model mammalian cell line used to 
study IBV infection) has identical electrophoretic mobility to 
that produced in insect (Sf9) cells, both of which had slower 
mobility than N protein produced in E. coli (12), suggesting 
that N protein synthesized in Vero and Sf9 cells had under¬ 
gone posttranslational modification. We note that Western 
blot analysis suggested that N protein produced in Vero cells 
(by transfecting cells with pTriExIBVN [13], a plasmid which 
expressed N protein under a PolII promoter) has identical 
mobility to N protein isolated from Vero cells at 8 h postin¬ 
fection (i.e., the virus has undergone one infectious round) 
(Fig. 4). 

While it was not possible to purify N protein from virions, by 


TABLE 2. Average kinetic values for complexes of analyte 
(nonphosphorylated N protein) with ligand RNA to investigate 
the effects of mass transport and the use of different 
flow rates to reduce this effect 


Flow rate of mass transfer 
control ((xl/min) a 

k a (1/ms) 

k d (1/s) 

K a (1/M) 

K d (nM) 

5 

3.51 X 10 4 

4.67 X 1CT 4 

7.52 X 10 7 

13.3 

15 

4.15 X 10 4 

3.5 X 1CT 4 

1.19 X 10 s 

8.44 

75 

5.46 X 10 4 

3.8 X 1CT S 

1.44 X 10 8 

6.97 


a Chi-square test, 1.98. 


Downloaded from http://jvi.asm.org/ on April 4, 2014 by guest 












1168 


CHEN ET AL. 


J. Virol. 


80 . 

60 

40 

20 ... 

0 

20 ... 

40 

60 ... 

-80 



-100 


-2000 




fh 






3min 




_g_ 20min 


-1500 -1000 


-500 


500 1000 


Time (s) 

FIG. 3. Sensorgram showing the binding of N protein to CD-61 to investigate whether any secondary binding occurred, 6.25 nM nonphos- 
phorylated N protein was injected for 1, 3, or 20 min. RU, response units. 


using mass spectroscopy, we investigated the posttranslational 
modifications that occurred on N protein produced in Vero 
cells to N protein produced in Sf9 cells. If posttranslational 
modifications of N protein in Sf9 cells were the same as N 
protein synthesized in Vero cells, then we would use N protein 
produced by the former method as our source of phosphory- 
lated protein for surface plasmon resonance, as larger amounts 
of protein can be obtained for analysis. N protein produced in 
and purified fromE. coli, Sf9 cells, and Vero cells was analyzed 
by dynamic light scattering (data not shown), which indicated 
that the protein was monodisperse and had a hydrodynamic 
radius (whatever the source) of approximately 10 nm. 

IBV N protein contains internal disulfide bonds. IBV N pro¬ 
tein expressed in E. coli, Sf9 cells, and Vero cells was subjected 
to proteolytic digestion and analysis by capillary HPLC-MS. A 
representative HPLC chromatogram from analysis of peptides 
released by tryptic digestion of N E coli is shown in Fig. 5. Many 
proteolytic peptides were identified, which covered a large 
amount of the IBV-N protein sequence; peaks are labeled with 
the residue numbers that the peptides span. 

Comparison of peptide profiles before and after reduction 
with dithiothreitol allowed us to identify various peptides that 
contained disulfide bonds. Based on the masses of these pep¬ 
tides, we tentatively assigned disulfide bonds between residues 
Cys281 to Cys308 and Cys320 to Cys323. To confirm these 
assignments, we also desalted a portion of the peptide mixture 
by use of a ZipTip and subjected these peptides to tandem MS 
analysis by using collision activation. Figure 6 shows the tan¬ 
dem mass spectrum of the peptide believed to contain residues 
318 to 331, with an internal disulfide bond between residues 
Cys320 and Cys323. The fragment ions are labeled according 
to the convention of Roepstorff and Fohlman (50), and this 


spectrum confirms the identity of the peptide. After reduction, 
the mass of this peptide increased by 2 Da, and this confirmed 
the presence of the disulfide bond. Likewise, we obtained MS/ 
MS data that confirmed our assignment of the disulfide bond 
between Cys281 and Cys308 (data not shown). The presence of 
these two disulfide bonds was also confirmed in N s£9 by similar 
methods (data not shown). A range of peptides covering over 
65% of the sequence of N E coli was identified, but we found no 


Ml 2 3 4 



FIG. 4. Western blot analysis of IBV N protein in infected cells 
(lane 1), cells transfected with a plasmid expressing N protein (lane 2), 
mock-infected cells (lane 3), and mock-transfected cells (lane 4). The 
appropriate molecular weight markers are indicated to the left of their 
lane (M). The position of the IBV N protein is indicated by an arrow. 
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FIG. 5. Capillary HPLC-MS analysis of peptides released after tryptic digestion of N E co]i . Peptides were detected by both UV absorbance at 
214 nm (upper trace) and mass spectrometric total ion current (TIC, lower trace). Peaks in the upper trace are labeled with the retention time 
of the peptides, and those in the lower trace are labeled according to the residue numbers of N E coli tryptic peptides that were identified. Peaks 
marked with asterisks correspond to trypsin autolysis products. 


evidence for further posttranslational modifications in this pro¬ 
tein. 

N sf9 contains additional posttranslational modifications. 

Comparison of digests of N E coM with N sf9 allowed us to in¬ 
vestigate posttranslational modifications that are specific to 
N sf9 . Partial digestion of N E coli with endoproteinase Glu-C 
produced two N-terminal fragments corresponding to residues 
1 to 182 and 1 to 220, which had measured masses close to their 
theoretical masses, implying that no modifications had oc¬ 
curred within this region (Fig. 7a). The corresponding polypep¬ 
tides from N ss had measured masses that were different from 


those predicted from the sequence. The measured mass of res¬ 
idues 1 to 182 was 19,574 Da, around 44 Da higher than the 
measured mass of N E coli protein residues 1 to 182. In addition, 
N s(9 protein residues 1 to 220 were present in three different 
forms; the lowest-mass form had a measured mass that was, 
again, 44 Da higher than that of N E coli residues 1 to 220. The 
other species had measured masses that were a further 81 and 
157 Da higher (Fig. 7b). These mass shifts occurred in the 
polypeptide spanning residues 1 to 220 but not residues 1 to 
182, suggesting that the modifications were within residues 183 
to 220. 



FIG. 6. Tandem mass spectrum of a putative disulfide bond-containing peptide. The peptide covers residues Ile318 to Lys331 and was released 
by tryptic digestion of N E coli . The presence of both diagnostic b and y series fragment ions confirm the identity of this peptide. The mass of the 
peptide was observed to increase by 2 Da after reduction of disulfide bonds with dithiothreitol, confirming the existence of the single disulfide bond 
within this peptide. 


Downloaded from http://jvi.asm.org/ on April 4, 2014 by guest 





























































1170 


CHEN ET AL. 


J. Virol. 



FIG. 7. Deconvoluted electrospray mass spectra of polypeptides 
released by endoproteinase Glu-C digestion of N E coli (a) and N sf9 (b) 
proteins, (a) Peaks corresponding to residues 1 to 182 (theoretical 
mass, 19,531.7 Da) and 1 to 220 (theoretical mass, 23,641.3 Da) are 
evident, (b) Peaks corresponding to the same polypeptides have 
masses that have increased by around 40 Da. This mass difference was 
measured to be 44 Da from the original peak envelopes. In addition, 
the polypeptide spanning residues 1 to 220 is present in three different 
forms, with each separated by around 80 Da. This is consistent with up 
to two phosphorylations in this polypeptide. The absence of these 
signals in the peptide corresponding to residues 1 to 182 implies that 
the sites of phosphorylation are between residues 183 and 220. 


The modification of 44 Da within residues 1 to 182 was 
consistent with acetylation (addition of 42 Da), and a proteo¬ 
lytic peptide spanning residues 1 to 51 also carried this modi¬ 
fication (measured difference, 42.6 Da) (data not shown). The 
measured mass differences occurring between residues 183 to 
220 were consistent with the addition of two phosphate groups 
(phosphorylation results in a mass increase of 80 Da per group). 
Endoproteinase Glu-C digestion also generated C-terminal 
peptides with different masses between N E coli and N sf9 pro¬ 
teins, suggesting additional phosphorylation sites within this 
domain (data not shown). 

Residues in both the N- and C-terminal domains are phos- 
phorylated in both N sf9 and N Vero . To locate the sites of phos¬ 
phorylation, N s(9 was digested with trypsin to yield smaller 
fragments; the peptide pool was desalted by use of a ZipTip 
and was analyzed by offline MS in negative-ion mode. To spe¬ 
cifically detect phosphopeptides, a precursor ion scan was used 
that detected peptides fragmenting to yield a moiety of an m/z 
of 79 (P0 3 ~). The resulting spectrum is shown in Fig. 8, and 
peaks are labeled with putative residue numbers of the pep¬ 


tides, assigned solely from the peptide mass. From these data, 
it appears that there are two sites of phosphorylation between 
residues 186 to 198 and a further two between residues 367 to 
394. A fresh aliquot of tryptic peptide mixture was analyzed in 
positive-ion mode, and the relevant phosphopeptides were 
subjected to collision activation experiments by MS/MS. The 
MS/MS spectrum of the doubly phosphorylated peptide with a 
mass of 3,460.4 Da from N s(g is shown in Fig. 9a. Fragment 
ions are labeled according to the convention of Roepstorff and 
Fohlman (50). Neutral losses of 98 Da (H 2 P0 4 ) from the 
parent ion, characteristic of phosphopeptides, was observed. In 
addition, structural information was obtained from sequence- 
specific fragment ions, which are shown more clearly in the 
magnified inset of Fig. 9a. By comparison with corresponding 
MS/MS spectra of singly and unphosphorylated peptides, the 
sites of phosphorylation were distinguished. The data showed 
that the peptide spans residues 367 to 394 and that the sites of 
phosphorylation are Thr378 and Ser379. In the singly phos¬ 
phorylated peptide, however, it is not clear which of these two 
sites is occupied. 

MS/MS spectra were also obtained on putative phospho¬ 
peptides from the N-terminal region of N sf9 (data not shown), 
and these confirmed that the sites of phosphorylation are 
Serl90 and Serl92. The MS/MS spectrum of the nonphosphor- 
ylated peptide confirmed the identity of the peptide, whereas 
that of the singly phosphorylated peptide demonstrated that 
Serl92 was occupied in preference to Serl90. 

To confirm that these phosphorylation sites were present in 
N protein expressed by mammalian cells, the protein was ex¬ 
pressed in Vero cells (N Vero ), digested with trypsin, and ana¬ 
lyzed by HPLC-MS. Since the amount of protein obtained 
from these cells was low, this analysis was performed by use of 
a more-sensitive mass spectrometer, a Q-ToF instrument. The 
tryptic digest of N Vero was similar to that of N s(9 , and putative 
phosphopeptides were observed with the same masses as mea¬ 
sured for N s(9 . Operating in data-dependent acquisition mode 
allowed peptides to be automatically subjected to MS/MS anal¬ 
ysis as they eluted from the column, and this included putative 
phosphopeptides. The MS/MS spectrum of the singly phos¬ 
phorylated peptide spanning residues 188 to 198 from N Vero is 
shown in Fig. 9b. It is identical to that obtained from N sf9 (data 
not shown) and confirms that the initial site of phosphorylation 
in the N-terminal region is Serl92. We also obtained MS/MS 
spectra for the C-terminal phosphorylation sites. Our data 
indicated that the same phosphorylation sites were occupied in 
N Ver o as were found in N sf9 , establishing that N s(9 is a relevant 
model for phosphorylated N protein and thus could be used for 
kinetic analysis. N protein produced in E. coli is subsequently 
referred to as N nonphos , and N protein produced in Sf9 cells is 
referred to as N phos . 

Comparison of the affinity of binding of nonphosphorylated 
and phosphorylated forms of N protein with viral RNA by 
using surface plasmon resonance. We compared the binding 
of N nonphos protein and N phos protein with two models of the 
IBV genome. The first was RNA synthesized by in vitro tran¬ 
scription from pCD-61. The second was a synthetic biotinyl¬ 
ated RNA, RNAmer 3LeaderTAS, which was identical to the 
5' end of IBV mRNA 3 up to, and including, the translation 
initiation codon for gene 3a. CD-61 RNA was generated by 
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FIG. 8. Negative-ion mass spectrometric analysis of tryptic peptides of N ss by offline precursor ion scanning. The mass spectrum shows all ions 
that fragment to give a specific fragment ion with an m/z of 79 (P0 3 ~). Peaks are labeled with a letter to identify the peptide and a number 
corresponding to the charge state. Although many signals are evident, this is the result of heterogeneity in digestion and phosphorylation. All 
peptides span essentially the same residues in the N- and C-terminal domains, and these data suggest up to two phosphorylations between residues 
186 and 198 and an additional two between residues 367 and 394. 


runoff transcription from pCD-61 and hybridized to oligo(dT), 
which was immobilized on a streptavidin flow cell. 

Various concentrations of either N nonphos or N phos proteins 
were passed over immobilized CD-61 RNA. The resultant sen- 
sorgrams are shown in Fig. 10, and kinetic analysis is shown in 
Table 3. The data suggested that there was no significant dif¬ 
ference in the binding of N nonphos protein (K d , 8.89 ± -0.08 
nM) to CD-61 than N phos protein ( K d , 16.9 ± —0.53 nM), 
which indicated that both forms of N protein associated with 
viral RNA with high affinity. The preparation of the CD-61- 
oligo(dT) complex may also have contained free oligo(dT), 
which was available to bind to the streptavidin flow cell. How¬ 
ever, we found that N protein did not bind to immobilized 
oligo(dT) (data not shown), suggesting that any binding char¬ 
acteristics of the CD-61-oligo(dT) complex were due to inter¬ 
actions with CD-61 RNA only. 

Biotinylated RNAmer 3LeaderTAS was immobilized di¬ 
rectly onto the streptavidin flow cell, 12.5 nM N nonphos protein 
or N phos protein was passed over, and the binding affinity was 
determined. The sensorgrams for 3LeaderTAS with N phos and 
Nnonphos proteins are shown in Fig. 11 and 12, respectively. The 
kinetic analysis of the binding experiments is shown in Table 
4. On first inspection, the data indicated that N nonp hos pro- 
tein ( K d , 0.657 nM) had a higher affinity for 3LeaderTAS than 
N p hos protein ( K d , 2.82 nM). However, we found that the vari¬ 
ation in binding affinities between replicate experiments with 
RNAmers was approximately ±1.5 nM (data not shown), and 
therefore, the binding of N nonphos protein and N phos protein 
with 3LeaderTAS was not significantly different. Although 
these binding affinities were higher than with CD-61, they both 
reflected high-affinity binding of N protein with viral RNA. 

Phosphorylation decreases the affinity of N protein for ran¬ 
dom RNA. As a model for nonviral RNA, we synthesized a 
biotinylated RNAmer (Random90) of identical length to 
RNAmer 3LeaderTAS that was based on the E. coli sequence, 


but the GC content was reduced to reflect a GC ratio (28%) 
similar to that of the leader RNA. The biotinylated RNAmer 
was immobilized to the streptavidin flow cell, and then 12.5 nM 
concentrations of both forms of N protein were injected for 
360 s. The sensorgrams for RNAmer Random90 are shown in 
Fig. 11 and 12. The kinetic analysis of the binding experiments 
is shown in Table 4. The binding affinity of N phos protein to 
RNAmer Random90 was 810 nM compared to 3.24 nM for 
Nnonphos protein, a 250-fold difference. This was probably be¬ 
cause the association rate of N phos protein to RNAmer 
Random90 is approximately 170 times lower than that of 
Nnonphos protein. The dissociation rates for both proteins are 
approximately the same. These data indicate that phosphor¬ 
ylation of N protein decreases the affinity for random RNA. 
A side-by-side comparison of the binding of both species of 
N protein with 3LeaderTAS and Random90 is also shown in 
Fig. 12. 

Core sequence determines association rate. We investigated 
the effect of the core sequence on RNA binding by using two 
different approaches: by scrambling the core sequence from 
gene 3 (RNAmer 3LeaderSAT) and by replacing eight nucle¬ 
otides of the Random90 sequence with the gene 3 core site 
(RNAmer Random90TAS) in the same location as the viral 
RNA, i.e., at nucleotide 57 of the leader sequence. The muta¬ 
tions introduced into the gene 3 core sequences in RNAmer 
3LeaderSAT were the same mutations that abolished tran¬ 
scription in an IBV minireplicon (65) and were thus biologi¬ 
cally relevant. Both biotinylated RNAmers were immobilized 
to the streptavidin flow cell, and then 12.5 nM N phos protein 
was passed over these ligands for 360 s. The sensorgrams for 
3LeaderSAT and Random90TAS are shown in Fig. 11. The 
kinetic analysis of the binding experiments is shown in Table 4. 
Abolition of the core sequence did not appear to significantly 
affect the affinity of binding of N phos to the IBV gene 3 leader 
sequence. However, the addition of the core site to Random90 
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FIG. 9. Tandem mass spectra of phosphopeptides released after tryptic digestion of N sf9 (a) and N Vero (b). The parent ion in panel a had an 
m/z of 866, corresponding to the putative diphosphorylated peptide of residues 367 to 394 with four charges. Characteristic neutral losses of 98 
Da, m/z of 841.86 and 817.45, can be observed. The inset in panel a is a magnification of the region of the spectrum containing sequence-specific 
fragment ions and confirms the identity of the peptide and the sites of phosphorylation as Thr378 and Ser379. The parent ion in panel b had an 
m/z of 642.8, corresponding to the peptide of residues 188 to 198 with a single phosphorylation. A single neutral loss to an m/z of 593.8 is evident. 
The sequence-specific fragment ions confirm the identity of the peptide and conclusively indicate the site of phosphorylation as Serl90. In all cases, 
peaks are labeled according to the fragment ion nomenclature of Roepstorff and Fohlman (50). 


increased the affinity of binding of N phos protein to RNA 
approximately 240 times, mainly due to the increased associa¬ 
tion rate, indicating that the core site affected the association 
rate between N phos protein and RNA but not the dissociation 
rate. The same trend was observed for the binding of N nonphos 
protein to these RNAmers (Table 4; Fig. 11), and the differ¬ 
ence in binding affinity by N nonphos protein and the different 
RNAmers was less than that for the N phos protein. 

Multiple N protein binding sites are present in leader se¬ 
quence. The above data indicated that, although the TAS acted 
to promote high-affinity binding of N protein on a nonviral 
RNA, when the site was mutated in the leader sequence, there 
was no significant difference to binding, suggesting that other 
binding sites may be present. Analysis of the structure of the 
IBV leader sequence indicated that two predicted stem-loops 


were formed and that the sequences corresponding to these 
predicted structures were conserved in different strains (66). 
Where nucleotide differences occurred between strains, a cor¬ 
responding covariant mutation was present to conserve the 
appropriate predicted stem-loop structure (66). Predicted 
stem-loop one (SL1) is formed between nucleotides 7 and 30, 
and predicted stem-loop two (SL2) is formed between nucle¬ 
otides 34 and 48. Although a third stem-loop is formed when 
predicting the structure of the leader sequence, this feature is 
not present when the analysis is extended in the 3' direction (P. 
Britton, personal communication). Therefore, we investigated 
the role of the two conserved predicted stem-loop regions up¬ 
stream of the core element in binding phosphorylated N pro¬ 
tein. Appropriate regions of the leader sequence were replaced 
with the corresponding nucleotides from the random nonviral 
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FIG. 10. Sensorgrams showing the binding of different concentra¬ 
tions (indicated above the appropriate nanomolar concentration) of 
either phosphorylated (A) or nonphosphorylated (B) N protein to 
CD-61 RNA by using surface plasmon resonance. RU, response units. 


RNA. All subsequent experiments were conducted with N phos 
protein, as this protein could distinguish between nonviral and 
viral RNA and probably represents the predominant form of N 
protein in the infected cell and virion (31). 

Replacement of sequence downstream of the TAS with ran¬ 
dom RNA (RNAmer SLlSL2TASrand) did not affect the bind¬ 
ing affinity of N protein when compared to 3LeaderTAS ( K d , 
2.59 and 2.82, respectively) (Table 4; Fig. 11). The association 
rate was three times slower. The binding affinity of N phos pro¬ 
tein on SLlSL2rand was also less (K d , 11.7) when compared to 
SLlSL2TASrand (K d , 2.59), which is a reflection of the lower 
association rate, again indicating that the TAS contributes to 
association rate (e.g., Random90TAS has the fastest associa¬ 
tion rate for the binding of N phos protein). 

SL2 can act as a high-affinity N protein binding site. Re¬ 
placement of SL1 with random RNA (RNAmer DSL1SL2TAS) 
did not appear to significantly affect the binding affinity ( K d , 
2.42) when compared to the leader RNA (K d , 2.82), although 
the disassociation rate was slower (Table 4; Fig. 11). The asso¬ 
ciation rate of N phos protein with RNA was lower with DSL1SL2 
compared to the leader RNA, and the binding affinity was 
approximately five times less. Interestingly, the data indicated 
that SL2 by itself could act as a high-affinity binding region for 


N protein, as replacement of flanking sequences with random 
RNA gave a binding affinity with a K d of 1.51. 

Role of SL1 in N protein binding. SL2 and sequences down¬ 
stream of the TAS were replaced with random RNA, forming 
RNAmer SLITASrand, so the RNA contained only SL1 and a 
TAS. N phos protein bound to this RNA with higher affinity ( K d , 
0.63) than to the leader RNA (Table 4; Fig. 11). Unfortunately, 
an RNAmer containing SL1 only proved difficult to synthesize, 
and thus, we were unable to determine the binding affinity of 
Npho S protein to SL1. However, the binding affinity of N phos 
protein to SLITASrand is greater than to RandomTAS, sug¬ 
gesting that SL1 can promote high-affinity binding. 

Binding of N phos protein to antileader RNA. In coronavirus- 
infected cells, negative-sense copies of sgRNAs are present 
and contain antileaders (60). These negative-strand RNAs can 
act as templates for the synthesis of positive-strand sgRNAs 
(24, 54-57, 59). We investigated the binding of N phos protein to 
antileader RNA. This RNAmer was the exact anticopy of 
3LeaderTAS, except that it was one nucleotide shorter because 
the synthesis process could not terminate with a U (Invitro- 
gen). The kinetic analysis (Table 4; Fig. 11) indicated that 
Nphos protein formed a high-affinity association with antileader 
(2.17 nM). 

DISCUSSION 

The RNA binding kinetics of the coronavirus nucleoprotein 
to RNA and relevant binding motif(s) and the role of phos¬ 
phorylation in this process has not previously been determined. 
To model this, we made use of IBV N protein purified from 
either E. coli or Sf9 cells to generate nonphosphorylated and 
phosphorylated proteins, respectively, which we have used to 
study interactions with nucleolin (13). Mass spectroscopy con¬ 
firmed that N protein purified from E. coli was nonphosphor¬ 
ylated, whereas N protein purified from Sf9 cells was phos¬ 
phorylated. These phosphorylation sites were identical to the 
phosphorylation sites found on N protein expressed in Vero 
cells, which is the mammalian cell line model for IBV. The 
data suggested that IBV N protein could be phosphorylated at 
two clusters, amino acid residues Serl90 and Serl92 and 
Thr378 and Ser379, which are proximal to RNA binding do¬ 
mains. The majority of species was either mono- or biphos- 
phorylated. These sites are predicted casein kinase II (CKII) 


TABLE 3. Average kinetic values for complexes of phosphorylated 
and nonphosphorylated N protein with CD-61° 


Analyte 
concn (nM) 

k a (1/ms) 

k d 

(1/s) 


K a (1/M) 

K d (nM) 

12.50 

3.21 

X 

10 4 

3.25 

X 

10- 

-4 

9.89 

X 

10 7 

10.10 


3.45 

X 

10 4 

2.1 

X 

10- 

-4 

1.64 

X 

10 s 

6.08 

9.375 

2.04 

X 

10 4 

3.74 

X 

10- 

-4 

5.46 

X 

10 7 

18.30 


4.31 

X 

10 4 

2.22 

X 

10' 

-4 

1.94 

X 

10 s 

5.16 

6.25 

1.78 

X 

10 4 

4.49 

X 

10- 

-4 

3.96 

X 

10 7 

25.30 


4.26 

X 

10 4 

2.86 

X 

10- 

-4 

1.49 

X 

10 s 

6.72 

4.69 

2.44 

X 

10 4 

4.72 

X 

10- 

-4 

5.16 

X 

10 7 

19.40 


3.34 

X 

10 4 

3.4 

X 

10" 

-4 

9.81 

X 

10 7 

10.20 

3.13 

3.19 

X 

10 4 

3.71 

X 

10- 

-4 

8.61 

X 

10 7 

11.60 


2.82 

X 

10 4 

4.61 

X 

10- 

-4 

6.12 

X 

10 7 

16.30 


a For each analyte concentration, the result for phosphorylated N protein is 
given above the result for nonphosphorylated N protein. 
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FIG. 11. Sensorgrams showing the binding of phosphorylated N protein to various biotinylated RNAmers (indicated) by using surface plasmon 
resonance. RU, response units. The sensorgram for 3LeaderTAS (indicated by an asterisk) is superimposed for reference. 


phosphorylation sites (32). Additional acidic amino acids in 
positions +1, +2, +4, and +5 have been reported to increase 
the phosphorylation rate at CKII sites, whereas a basic residue 
N-terminal of the site will decrease the phosphorylation rate. 
In the case of Serl90, there are acidic amino acids in the +1 
(Aspl91), +4 (Aspl93), and +5 (Aspl94) positions (Fig. 13). 
However, there are two basic residues N-terminal of this site 
(Argl88 and Argl89), which may account for the observation 
that Serl92 was occupied in preference to Serl90. In the case 
of Serl92, there are acidic amino acids in the +2 (Aspl93) and 
+3 (Aspl94) positions (Fig. 13). For the C-terminal phosphor¬ 
ylation sites, Thr378 has preferential acidic amino acids in the 
+2 (Asp380) and +4 positions (Asp382) and Ser379 has pref¬ 
erential acidic amino acids in the +1 (Asp380) and +2 


(Asp381) positions (Fig. 13). CKII is present in the mitochon¬ 
dria and cytoplasm, but predominately in the nucleus, and 
several nuclear proteins serve as substrates, for example, p53, 
RNA polymerases I and II, and the nucleolar proteins B23 and 
C23 (32). Phosphorylation by CKII alters the nucleic acid or 
catalytic activity of some of these substrates. Coronavirus N 
proteins have both a cytoplasmic and nuclear/nucleolar distri¬ 
bution (25, 42, 81), as do other N proteins of the Nidovirales 
(51, 52, 74). Therefore, it is interesting to speculate that coro¬ 
navirus N proteins can be posttranslationally modified by 
nuclear proteins (e.g., phosphorylation by CKII), suggesting 
nuclear involvement in positive-strand RNA virus replica¬ 
tion (22, 23, 77). CKII has also been shown to be involved in 
the phosphorylation and transcriptional activity of a number 


Downloaded from http://jvi.asm.org/ on April 4, 2014 by guest 




















Vol. 79, 2005 


RNA BINDING PROPERTIES OF CORONA VIRUS NUCLEOPROTEIN 


1175 


DSL1SL2rand SLITASrand 



Anti-leader 



of viral proteins, including vesicular stomatitis virus P pro¬ 
tein (3). 

HPLC analysis of tryptic phosphopeptides revealed that phos¬ 
phorylation of MHV N protein occurred at only two or three 
sites (79), which Laude and Masters (31) suggested was rela¬ 
tively small, given that MHV N protein had some 30 to 40 
potential target residues for phosphorylation. Sequence anal¬ 
ysis indicated that IBV had at least 10 potential residues for 
phosphorylation (53); however, our mass spectroscopy results 
for IBV N protein would support the hypothesis that relatively 
few potential sites of phosphorylation in the coronavirus N 
proteins are phosphorylated. 

Although our principal aim with mass spectroscopy was to 
investigate the phosphorylation status of N protein, analysis 
revealed several other potential modifications. We found that 
the N-terminal region of N protein may be acetylated. Acety¬ 
lation in the N-terminal region of both cellular and viral pro¬ 
teins is a common posttranslational modification, e.g., the 
phosphoprotein (P) of some paramyxoviruses (62). Although 
acetylation of proteins has a number of different functions, it is 
also involved in transcriptional regulation, e.g., acetylation of 
human immunodeficiency virus Tat protein (43) and CREB- 
mediated gene expression (33). Acetylation of the coronavirus 
N protein could therefore play a role in the transcriptional 
regulatory activity the protein may display. N protein from the 
three different sources had identical disulfide bridges, formed 
between residues Cys281 and Cys308 and Cys320 and Cys323. 
Analysis of N protein by sodium dodecyl sulfate-polyacrylam¬ 
ide gel electrophoresis analysis under native conditions indi¬ 
cated that the predominant species in solution were both mo¬ 
nomeric and multimeric (but not dimeric), whereas under 
reducing conditions, N protein was present as a monomer only 
(12). Although these bridges may have formed during the 


extraction and purification of the protein, normally when this 
occurs, disulfide bridge formation occurs sequentially, and there¬ 
fore, we would have predicted that disulfide bridges would 
form between Cys228 and Cys281 and Cys308 and Cys320. 
However, we did not observe this. Although N protein is cy¬ 
tosolic, non-endoplasmic reticulum-based redox pathways can 
contribute to disulfide bridge formation (61). Disulfide bridge 
formation may therefore be important in the correct folding of 
N protein. However, it should be noted that the severe acute 
respiratory syndrome coronavirus N protein does not contain 
any cysteine residues and therefore would not contain disulfide 
bridges (34). 

Surface plasmon resonance (Biacore) has not previously 
been applied to the study of the interaction of coronavirus N 
protein with coronavirus RNA. Therefore, we used this tech¬ 
nique to investigate the interaction of nonphosphorylated and 
phosphorylated IBV N protein with a variety of viral and non- 
viral target RNAs. In the case of the biotinylated RNAmers, 
we were able to examine the interaction of N protein with 
target RNAs at near physiological temperature (37°C). We 
examined the binding of nonphosphorylated and phosphory¬ 
lated N protein on both the gene 3 leader sequence and CD-61. 
There was no significant difference in the binding of nonphos¬ 
phorylated and phosphorylated N protein to leader RNA or a 
defective interfering RNA (DI-RNA). Our results are in con¬ 
trast to those of Cologna et al. (14), who found that the bovine 
coronavirus (BCoV) N protein bound more efficiently to a 
BCoV minireplicon than to the BCoV leader sequence. This 
discrepancy may be due to the differences between filter bind¬ 
ing assays used by Cologna et al. (14) and surface plasmon 
resonance (45) used in our study, or it may be an intrinsic 
property of the different coronavirus N proteins. However, our 
results suggested that phosphorylated N protein had 264 times 
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FIG. 12. Sensorgrams showing the binding of nonphosphorylated N protein to various biotinylated RNAmers (indicated) by using surface 
plasmon resonance. RU, response units. The sensorgram for 3LeaderTAS (indicated by an asterisk) is superimposed for reference. The bottom sen- 
sorgram is for comparison of the binding of nonphosphorylated (N nonphos ) and phosphorylated (N phos ) N protein to both 3LeaderTAS and Random90. 


greater affinity for the leader sequence than a nonviral RNA. N 
protein failed to bind to an oligomer with 25 T residues, indi¬ 
cating that sequence context may be important for nucleotide 
binding. The differences in the amount of N binding to RNA, the 


TABLE 4. Average kinetic values for complexes of phosphorylated 
and nonphosphorylated N protein with target RNAmers" 


Target RNA 

k a (1/ms) 

k d (1/s) 

K a (1/M) 

K d (nM) 

3LeaderTAS 

7.81 X 10 4 

2.20 X 

10~ 4 

3.55 X 10 s 

2.82 


1.58 x 10 5 

1.04 X 

10~ 4 

1.52 X 10 9 

0.657 

3LeaderSAT 

6.38 X 10 4 

2.36 X 

10~ 4 

2.7 x 10 s 

3.70 


4.48 X 10 4 

7.24 X 

10~ 5 

6.19 X 10 s 

1.62 

Random90 

606 

4.9 X 

10~ 4 

1.24 X 10 6 

810 


1.02 X 10 s 

3.30 x 

10~ 4 

3.08 X 10 s 

3.24 

Random90TAS 

1.12 X 10 5 

3.82 x 

10~ 4 

2.94 X 10 s 

3.40 


1.59 X 10 s 

1.70 x 

10~ 4 

9.34 X 10 s 

1.07 

SLlSL2TASrand 

2.36 X 10 4 

6.11 x 

10~ 5 

3.86 X 10 s 

2.59 

SLlSL2rand 

4.73 X 10 3 

5.53 x 

10~ 5 

8.56 x 10 7 

11.7 

DSL1SL2TAS 

3.83 X 10 4 

9.26 x 

10~ 5 

4.13 X 10 s 

2.42 

DSL1SL2 

7.41 X 10 3 

1.12 x 

10~ 4 

6.62 x 10 7 

15.10 

DSLlSL2rand 

2.94 X 10 4 

4.44 X 

10” 5 

6.61 x 10 s 

1.51 

SLITASrand 

4.21 X 10 4 

2.65 X 

10- 5 

1.59 x 10 9 

0.63 

Antileader 

5.73 X 10 4 

1.24 X 

10~ 4 

4.62 x 10 s 

2.17 


" Where two values are given for a target RNA, the result for phosphorylated 
N protein is given above the result for nonphosphorylated N protein. 


source of N protein, and the kinetics of this interaction may 
account for why previous studies could not resolve whether N 
bound either specifically to viral RNA or bound with equal 
affinity to both viral and nonviral RNA. 

Nelson et al. (41) measured the binding affinity of MHV N 
protein (isolated from E. coli and therefore nonphosphoryiat- 


1MASGKAAGKT DAPAPVIKLG GPKPPKVGSS GNASWFQAIK AKKLNTPPPK 
51FEGSGVPDNE NIKPSQQHGY WRRQARFKPG KGGKKPVPDA WYFYYTGTGP 
101AADLNWGDTQ DGIVWVAAKG ADTKSRSNQG TRDPDKFDQY PLRFSDGGPD 
151GNFRWDFIPL NRGRSGRSTA ASSAAASRAP SREGSRGRRS DSGDDLIARA 


(kk 


201AKIIQDQQKK GSRITKAKAD EMAHRRYCKR TIPPNYRVDQ VFGPRTKGKE 
251GNFGDDKMNE EGIKDGRVTA MLNLVPSSHA CLFGSRVTPK LQLDGLHLRF 
301EFTTWpiDD PQFDNYVKIC DQCVDGVGTR PKDDEPKPKS RSSSRPATRG 
351NS PAPRQQRP KKEKKLKKQD DEADKALTSD EERNNAQLEF YDEPKVINWG 
4 01DAALGENEL* 


(pOk 


FIG. 13. Amino acid sequence of the IBV Beaudette strain N 
protein (5) showing the sites of phosphorylation and disulfide 
bridge formation. 
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ed) to the MHV leader sequence and obtained a K d of 14 nM. 
Although the binding affinity of IBV N nonphos protein to the 
IBV leader ( K d , 0.66 nM) is greater than that of MHV N 
protein to MHV leader, this may be a reflection of the differ¬ 
ences in recombinant N protein preparation (His tag in the 
case of IBV N protein versus glutathione 5-transferase fusion 
for MHV N protein) and kinetic measurement (surface plas- 
mon resonance versus filter binding assay). With that said, the 
binding affinity of MHV N protein to MHV leader (41) and 
IBV N protein to IBV leader is very strong and thus probably 
reflects a general pattern for coronaviruses. Consideration 
should be given to the fact that N protein in virions and the 
infected cell is exclusively phosphorylated and is probably not 
nonphosphorylated (31). A phosphoamino acid analysis of the 
arterivirus (a member of the Nidovirales) porcine reproductive 
and respiratory syndrome virus nucleocapsid protein revealed 
that this protein was also exclusively phosphorylated, and no 
nonphosphorylated species were found (80). Therefore, care 
should be taken when interpreting the results obtained with 
nonphosphorylated N protein and drawing biologically rele¬ 
vant conclusions, as this form of N protein is probably not 
present in virus-infected cells. We favor the hypothesis that the 
role of phosphorylation is to decrease the affinity of N protein 
for random RNA so that N protein associates with viral RNA 
in preference to cellular RNA. 

Comparison of the binding affinity between IBV N protein 
and RNA and other RNA binding proteins also revealed that 
N protein had high-affinity binding. Using surface plasmon 
resonance, Park et al. (45) determined the binding affinity for 
human neuron-specific RNA binding (HuD) protein with AU- 
rich target biotinylated RNAs was approximately 0.7 nM. In 
this experiment, recombinant HuD protein was C-terminally 
His tagged and purified from E. coli and presumably nonphos¬ 
phorylated. The binding affinity of IBV N nonphos and N phos 
protein to leader RNA is similar. 

The principal function of N protein in terms of the infectious 
cycle of the virus can perhaps be viewed as binding to viral 
RNA to form the ribonucleocapsid structure, part of the core 
particle. In a number of viruses, viral capsids can form in the 
absence of RNA and instead can be replaced with either cel¬ 
lular RNAs or suitable polyanions; this has been perhaps best 
characterized in the encapsidation and assembly of nodavirus 
particles (29, 58, 73). The high affinity of phosphorylated N 
protein for viral RNA over nonviral RNA together with ap¬ 
propriate packaging signals (16, 18, 20, 37) may explain why 
virions are composed of viral RNA (48). 

Using the phosphorylated form of N protein, we found that 
the IBV gene 3 core sequence contributed to the kinetics of 
RNA binding. Introducing a core sequence into nonviral RNA 
(Random90) increased N binding, with the major change being 
in the association rate. Scrambling the core sequence in the 
context of the leader RNA resulted in only a small decrease in 
binding affinity, suggesting that the core sequence was not the 
only N protein-binding site present on leader RNA. Stirrups 
et al. (66) described two conserved predicted stem-loops that 
were present upstream of the core sequence in the leader se¬ 
quence. By replacing these sequences with random RNA (from 
Random90), that did not promote high-affinity binding, we 
showed that the sequences that form predicted SL1 and pre¬ 
dicted SL2 both contributed to high-affinity binding of phos¬ 


phorylated N protein to leader RNA. However, it should be 
noted that optimum RNA ligand binding sequences are not 
necessarily the same as those found in vivo (78). The high- 
affinity binding of N protein to the stem-loop structures may 
suggest that N protein could be a general double-stranded RNA 
binding protein. However, structural analysis of RNAmer 
Random90 indicated that this molecule formed stem-loop 
structures (data not shown), and in the case of the phosphor¬ 
ylated N protein, we observed low-affinity binding with this 
target compared to viral RNA, suggesting that, in general, N 
protein did not specifically recognize double-stranded RNA. 

Such stem-loop structures have also been identified in the 5' 
end of several other coronaviruses, including BCoV DI-RNA 
(10, 11) and MHV (75), and these sites may therefore also act 
as high-affinity sites for N protein. However, with that said, in 
their kinetic analysis, Stohlman et al. (68) located only one 
high-affinity binding site on the MHV leader RNA that bound 
MHV N protein, and this site encompassed the MHV core 
element. But discrepancies between our findings with IBV 
leader sequence and IBV N protein and the MHV leader 
sequence and MHV N protein, may be a reflection of the 
inherent properties of the molecules themselves or of the dif¬ 
ferent kinetic analysis used. 

The high-affinity association of phosphorylated N protein 
with the leader sequence and TAS may have implications 
for viral translation and possibly replication. If coronavirus 
mRNAs are translated in a cap-dependent manner, in that they 
initiate translation at the 5' end of the leader sequence, then 
the ribosome may be in conflict with the N protein when it 
scans along the leader sequence for the translation start codon. 
However, Tahara et al. (70) have speculated that N protein 
enhances the translation of MHV mRNAs. In this case, one 
may hypothesize that by binding to the leader RNA, N protein 
may reduce the secondary structure to increase the efficiency of 
translation. 

N protein may also bind, with high affinity, to the TAS 
elements located along the coronavirus genome, and one may 
therefore speculate that this interaction is involved in the ef¬ 
ficiency of transcription of (minus) sgRNAs. Also, N protein 
may bind to sites that are very similar in sequence to the core 
element (44) to facilitate transcription at these sites. Although 
the caveat is that N protein is not required to rescue all of the 
full-length coronavirus clones so far described, the presence of 
N protein does increase the rescue efficiency. An alternative is 
that the TAS sites act as nucleation sites for N protein to 
facilitate the encapsidation of virus RNAs. Such binding sites 
may act separately from the packaging signal that promotes a 
specific interaction with the N protein and the genomic RNA 
(18, 37) and presumably is required for the efficient packaging 
of the genomic RNA, but not sgRNAs, into core particles. 
MHV genomic RNA has been reported to be incorporated 
into the virion in the absence of N protein (38). 

In their review, Laude and Masters (31) described how sev¬ 
eral workers speculated that phosphorylation of N protein may 
determine its binding affinity with RNA. Our data indicated 
that phosphorylation of N protein decreased the affinity of N 
protein for random RNA. Two possible mechanisms may ac¬ 
count for this regulation. Phosphorylation of N protein may 
alter its conformation and thus affect the RNA binding site(s). 
Stohlman et al. (69) suggested that phosphorylation of MHV N 
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protein could lead to substantial conformational changes in 
protein structure, and phosphorylation of human immunode¬ 
ficiency virus type 1 Rev protein was shown to alter its confor¬ 
mation and RNA binding properties (21). Our mass spectro¬ 
scopic data would support the hypothesis of Stohlman et al. 
(69), in that phosphorylation occurs proximal to RNA binding 
domains. Alternatively, phosphorylation of N protein could 
add negative charge to the RNA binding domains and thus 
reduce electrostatic interaction with target RNA. A similar 
mechanism has been proposed to regulate binding of potato 
virus A coat protein to RNA (26). While this would explain the 
overall slight binding difference between N nonphos and N phos 
with viral RNA, such an interaction probably does not account 
for the large difference in binding between N phos protein and 
random RNA. Although the role of the conserved core se¬ 
quence in N protein binding has been known for sometime, the 
mechanism by which this is achieved was not. Our kinetic study 
would suggest that the core sequence determines the specificity 
of IBV N protein binding with viral RNA by increasing the 
association rate. However, this is not the only site within the 
IBV leader that promotes high-affinity binding with N protein, 
and the possibly exists that these structures act in concert to 
promote optimal binding. 
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